Joint Subtitle Extraction and Frame Inpainting for Videos with Burned-In Subtitles

نویسندگان

چکیده

Subtitles are crucial for video content understanding. However, a large amount of videos have only burned-in, hardcoded subtitles that prevent re-editing, translation, etc. In this paper, we construct deep-learning-based system the inverse conversion burned-in subtitle to file and an inpainted video, by coupling three deep neural networks (CTPN, CRNN, EdgeConnect). We evaluated performance proposed method found learning achieved high-precision separation frames significantly improved inpainting results compared existing methods. This research fills gap in application reconstruction is expected be widely applied re-editing with subtitles, advertisements, logos, other occlusions.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Visual Subtitles for Internet Videos

We present a visual aid for the hearing impaired to enable access to internet videos. The visual tool is in the form of a time synchronized lip movement corresponding to the speech in the video which is embedded in the original internet video. Conventionally, access to the audio or speech, in a video, by the hearing impaired is provided by means of either text subtitles or sign language gesture...

متن کامل

Automatic Subtitle Generation for Sound in Videos

The last ten years have been the witnesses of the emergence of any kind of video content. Moreover, the appearance of dedicated websites for this phenomenon has increased the importance the public gives to it. In the same time, certain individuals are deaf and occasionally cannot understand the meanings of such videos because there is not any text transcription available. Therefore, it is neces...

متن کامل

Key Frame Extraction from Videos - A Survey

With the advent of social networking, sharing of multimedia has gained tremendous amount of importance and is the widely used form of communication worldwide. In the process of discovering knowledge from videos, challenge is to process huge amount of information which is resourceintensive. One way to minimize the cost of computations is to reduce the amount of information that undergoes process...

متن کامل

tight frame approximation for multi-frames and super-frames

در این پایان نامه یک مولد برای چند قاب یا ابر قاب تولید شده تحت عمل نمایش یکانی تصویر برای گروه های شمارش پذیر گسسته بررسی خواهد شد. مثال هایی از این قاب ها چند قاب های گابور، ابرقاب های گابور و قاب هایی برای زیرفضاهای انتقال پایاست. نشان می دهیم که مولد چند قاب تنک نرمال شده (ابرقاب) یکتا وجود دارد به طوری که مینیمم فاصله را از ان دارد. همچنین مسایل مشابه برای قاب های دوگان مطرح شده و برخی ...

15 صفحه اول

Wavelet Frame Based Blind Image Inpainting

Image inpainting has been widely used in practice to repair damaged/missing pixels of given images. Most of the existing inpainting techniques require knowing beforehand where those damaged pixels are, either given as a priori or detected by some pre-processing. However, in certain applications, such information neither is available nor can be reliably pre-detected, e.g. removing random-valued ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Information

سال: 2021

ISSN: ['2078-2489']

DOI: https://doi.org/10.3390/info12060233